acyclicity constraint
- Europe > Belgium > Brussels-Capital Region > Brussels (0.14)
- Asia > China > Anhui Province (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Information Technology (0.46)
- Health & Medicine (0.46)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
- Information Technology > Artificial Intelligence > Natural Language (0.94)
- (4 more...)
Non-negative DAG Learning from Time-Series Data
This work aims to learn the directed acyclic graph (DAG) that captures the instantaneous dependencies underlying a multivariate time series. The observed data follow a linear structural vector autoregressive model (SVARM) with both instantaneous and time-lagged dependencies, where the instantaneous structure is modeled by a DAG to reflect potential causal relationships. While recent continuous relaxation approaches impose acyclicity through smooth constraint functions involving powers of the adjacency matrix, they lead to non-convex optimization problems that are challenging to solve. In contrast, we assume that the underlying DAG has only non-negative edge weights, and leverage this additional structure to impose acyclicity via a convex constraint. This enables us to cast the problem of non-negative DAG recovery from multivariate time-series data as a convex optimization problem in abstract form, which we solve using the method of multipliers. Crucially, the convex formulation guarantees global optimality of the solution. Finally, we assess the performance of the proposed method on synthetic time-series data, where it outperforms existing alternatives.
- Europe > Spain > Galicia > Madrid (0.05)
- North America > United States > New York > Monroe County > Rochester (0.04)
- North America > United States > California > Alameda County > Berkeley (0.04)
- (2 more...)
Differentiable Structure Learning with Partial Orders T aiyu Ban Lyuzhou Chen Xiangyu Wang
Differentiable structure learning is a novel line of causal discovery research that transforms the combinatorial optimization of structural models into a continuous optimization problem. However, the field has lacked feasible methods to integrate partial order constraints, a critical prior information typically used in real-world scenarios, into the differentiable structure learning framework. The main difficulty lies in adapting these constraints, typically suited for the space of total orderings, to the continuous optimization context of structure learning in the graph space. To bridge this gap, this paper formalizes a set of equivalent constraints that map partial orders onto graph spaces and introduces a plug-and-play module for their efficient application. This module preserves the equivalent effect of partial order constraints in the graph space, backed by theoretical validations of correctness and completeness. It significantly enhances the quality of recovered structures while maintaining good efficiency, which learns better structures using 90% fewer samples than the data-based method on a real-world dataset. This result, together with a comprehensive evaluation on synthetic cases, demonstrates our method's ability to effectively improve differentiable structure learning with partial orders.
- Europe > Belgium > Brussels-Capital Region > Brussels (0.14)
- Asia > China > Anhui Province (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Information Technology (0.46)
- Health & Medicine (0.46)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
Interpretable, multi-dimensional Evaluation Framework for Causal Discovery from observational i.i.d. Data
Velev, Georg, Lessmann, Stefan
Nonlinear causal discovery from observational data imposes strict identifiability assumptions on the formulation of structural equations utilized in the data generating process. The evaluation of structure learning methods under assumption violations requires a rigorous and interpretable approach, which quantifies both the structural similarity of the estimation with the ground truth and the capacity of the discovered graphs to be used for causal inference. Motivated by the lack of unified performance assessment framework, we introduce an interpretable, six-dimensional evaluation metric, i.e., distance to optimal solution (DOS), which is specifically tailored to the field of causal discovery. Furthermore, this is the first research to assess the performance of structure learning algorithms from seven different families on increasing percentage of non-identifiable, nonlinear causal patterns, inspired by real-world processes. Our large-scale simulation study, which incorporates seven experimental factors, shows that besides causal order-based methods, amortized causal discovery delivers results with comparatively high proximity to the optimal solution.
- Europe > Romania > București - Ilfov Development Region > Municipality of Bucharest > Bucharest (0.04)
- Asia > Middle East > Jordan (0.04)
- Europe > United Kingdom (0.04)
- Europe > Germany > Berlin (0.04)
- Education (0.34)
- Health & Medicine (0.31)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)
- (2 more...)
Non-negative Weighted DAG Structure Learning
Rey, Samuel, Saboksayr, Seyed Saman, Mateos, Gonzalo
We address the problem of learning the topology of directed acyclic graphs (DAGs) from nodal observations, which adhere to a linear structural equation model. Recent advances framed the combinatorial DAG structure learning task as a continuous optimization problem, yet existing methods must contend with the complexities of non-convex optimization. To overcome this limitation, we assume that the latent DAG contains only non-negative edge weights. Leveraging this additional structure, we argue that cycles can be effectively characterized (and prevented) using a convex acyclicity function based on the log-determinant of the adjacency matrix. This convexity allows us to relax the task of learning the non-negative weighted DAG as an abstract convex optimization problem. We propose a DAG recovery algorithm based on the method of multipliers, that is guaranteed to return a global minimizer. Furthermore, we prove that in the infinite sample size regime, the convexity of our approach ensures the recovery of the true DAG structure. We empirically validate the performance of our algorithm in several reproducible synthetic-data test cases, showing that it outperforms state-of-the-art alternatives.
- Europe > Spain > Galicia > Madrid (0.05)
- North America > United States > New York > Monroe County > Rochester (0.04)
Kernel-Based Differentiable Learning of Non-Parametric Directed Acyclic Graphical Models
Liang, Yurou, Zadorozhnyi, Oleksandr, Drton, Mathias
Causal discovery amounts to learning a directed acyclic graph (DAG) that encodes a causal model. This model selection problem can be challenging due to its large combinatorial search space, particularly when dealing with non-parametric causal models. Recent research has sought to bypass the combinatorial search by reformulating causal discovery as a continuous optimization problem, employing constraints that ensure the acyclicity of the graph. In non-parametric settings, existing approaches typically rely on finite-dimensional approximations of the relationships between nodes, resulting in a score-based continuous optimization problem with a smooth acyclicity constraint. In this work, we develop an alternative approximation method by utilizing reproducing kernel Hilbert spaces (RKHS) and applying general sparsity-inducing regularization terms based on partial derivatives. Within this framework, we introduce an extended RKHS representer theorem. To enforce acyclicity, we advocate the log-determinant formulation of the acyclicity constraint and show its stability. Finally, we assess the performance of our proposed RKHS-DAGMA procedure through simulations and illustrative data analyses.
- North America > United States > Florida > Palm Beach County > Boca Raton (0.04)
- Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
- North America > United States > New York (0.04)
- (3 more...)
TS-CausalNN: Learning Temporal Causal Relations from Non-linear Non-stationary Time Series Data
Faruque, Omar, Ali, Sahara, Zheng, Xue, Wang, Jianwu
The growing availability and importance of time series data across various domains, including environmental science, epidemiology, and economics, has led to an increasing need for time-series causal discovery methods that can identify the intricate relationships in the non-stationary, non-linear, and often noisy real world data. However, the majority of current time series causal discovery methods assume stationarity and linear relations in data, making them infeasible for the task. Further, the recent deep learning-based methods rely on the traditional causal structure learning approaches making them computationally expensive. In this paper, we propose a Time-Series Causal Neural Network (TS-CausalNN) - a deep learning technique to discover contemporaneous and lagged causal relations simultaneously. Our proposed architecture comprises (i) convolutional blocks comprising parallel custom causal layers, (ii) acyclicity constraint, and (iii) optimization techniques using the augmented Lagrangian approach. In addition to the simple parallel design, an advantage of the proposed model is that it naturally handles the non-stationarity and non-linearity of the data. Through experiments on multiple synthetic and real world datasets, we demonstrate the empirical proficiency of our proposed approach as compared to several state-of-the-art methods. The inferred graphs for the real world dataset are in good agreement with the domain understanding.
- North America > United States > Maryland > Baltimore County (0.14)
- North America > United States > Maryland > Baltimore (0.14)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- (3 more...)
- Research Report > New Finding (0.46)
- Research Report > Promising Solution (0.34)
- Energy (1.00)
- Government > Regional Government > North America Government > United States Government (0.67)
Multi-granularity Causal Structure Learning
Liang, Jiaxuan, Wang, Jun, Yu, Guoxian, Xia, Shuyin, Wang, Guoyin
However, these algorithms simply deem causal relationships stand exclusively at the level of individual variables Data science is moving from the data-centric paradigm forward (micro-variable), ignoring the collective interactions from the science-centric paradigm, and causal revolution multiple variables (macro-variable). For instance, the brain is sweeping across various research fields. Causality learning can be characterized at a micro granularity of neurons and endeavors to unearth causal relationships among variables their synapses, but high-order synergistic subsystems are from observational data and generate causal graph, widespread, which typically sit between canonical functional that is, directed acyclic graph (DAG). Unlike correlationbased networks and may serve an integrative role (Varley study, causality analysis reveals the causal mechanism et al. 2023). Actually, observational data can be regarded of data generation. Identifying causality holds paramount as knowledge in the lowest granularity level, while knowledge significance for stable inference and rational decisions can be regarded as the abstraction of data at different in many applications, such as recommendation systems granularity levels (Wang 2017; Wang et al. 2022). Similar (Wang et al. 2020), medical diagnostics (Richens, Lee, and viewpoints appear in the research of complex systems, Johri 2020), epidemiology (Vandenbroucke, Broadbent, and which suggests that causal relationship is more pronounced Pearce 2016) and many others (Von Kügelgen et al. 2022).
- Asia > China > Chongqing Province > Chongqing (0.05)
- Asia > China > Shandong Province > Jinan (0.04)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.83)
Stable Differentiable Causal Discovery
Nazaret, Achille, Hong, Justin, Azizi, Elham, Blei, David
Mathematically, a set of causal relations can be represented with a directed acyclic graph (DAG) where nodes are variables, and directed edges indicate direct causal effects. The goal of causal discovery is to recover the graph from the observed data. The data can either be interventional, where some variables were purposely manipulated, or purely observational, where there has been no manipulation. The challenge of causal discovery is the search for the true DAGs is NP-hard. Exact methods are intractable, even for modest numbers of variables [Chi96]. Yet datasets in fields like biology routinely involve thousands of variables [Dix+16]. To address this problem, Zheng et al. [Zhe+18] introduced Differentiable Causal Discovery (DCD), which formulates the DAG search as a continuous optimization over the space of all graph adjacency matrices. An essential element of this strategy is an acyclicity constraint, in the form of a penalty, that guides an otherwise unconstrained search toward acyclic graphs.
- North America > United States > New York (0.04)
- North America > United States > California (0.04)